• Article  

      Algorithmic mechanisms for reliable master-worker internet-based computing 

      Christoforou, Evgenia; Fernández Anta, Antonio; Georgiou, Chryssis; Mosteiro, Miguel A. (2014)
      We consider Internet-based master-worker computations, where a master processor assigns, across the Internet, a computational task to a set of untrusted worker processors, and collects their responses. Examples of such ...
    • Conference Object  

      Analyzing the number of slow reads for semifast atomic read/write register implementations 

      Georgiou, Chryssis; Kentros, Sotirios; Nicolaou, Nicolas C.; Shvartsman, A. A. (2009)
      Developing fast implementations of atomic read/write registers in the message passing model is among the fundamental problems in distributed computing. Typical implementations require two communication round trips for read ...
    • Article  

      Coding approaches to fault tolerance in linear dynamic systems 

      Hadjicostis, Christoforos N.; Verghese, G. C. (2005)
      This paper discusses fault tolerance in discrete-time dynamic systems, such as finite-state controllers or computer simulations, with focus on the use of coding techniques to efficiently provide fault tolerance to linear ...
    • Conference Object  

      Confidential gossip 

      Georgiou, Chryssis; Gilbert, S.; Kowalski, D. R. (2011)
      Epidemic gossip has proven a reliable and efficient technique for sharing information in a distributed network. Much of the reliability and efficiency derives from processes collaborating, sharing the work of distributing ...
    • Conference Object  

      Coordinated cooperative work using undependable processors with unreliable broadcast 

      Davtyan, S.; Prisco, R. D.; Georgiou, Chryssis; Shvartsman, A. A. (IEEE Computer Society, 2014)
      With the end of Moore's Law in sight, parallelism became the main means for speeding up computationally intensive applications, especially in the cases where large collections of tasks need to be performed. Network ...
    • Conference Object  

      Decentralized fault accommodation of a class of interconnected nonlinear systems using an adaptive approximation approach 

      Panagi, P.; Polycarpou, Marios M. (2009)
      This paper presents a decentralized adaptive approximation design for the fault tolerant control of interconnected subsystems. We consider faults that occur in the subsystems local dynamics as well as in the interconnection ...
    • Article  

      Designs of bisimilar Petri net controllers with fault tolerance capabilities 

      Li, L.; Hadjicostis, Christoforos N.; Sreenivas, R. S. (2008)
      This paper proposes an approach for providing tolerance against faults that may compromise the functionality of a given controller modeled by a Petri net. The method is based on embedding the given Petri net controller ...
    • Article  

      Encoded dynamics for fault tolerance linear finite-state machines 

      Hadjicostis, Christoforos N.; Verghese, G. C. (2002)
      This note discusses fault tolerance in linear finite-state machines (LFSMs). By replacing a given LFSM with a larger, redundant LFSM that preserves the state of the original one in some linearly encoded form, various ...
    • Article  

      Error detection and correction in switched linear controllers via periodic and non-concurrent checks 

      Sundaram, S.; Hadjicostis, Christoforos N. (2006)
      Control systems that utilize switched linear controllers have proven to be useful (and, in some cases, essential) for accomplishing certain control objectives in particular classes of plants. These controllers are often ...
    • Article  

      Failure-sensitive analysis of parallel algorithms with controlled memory access concurrency 

      Georgiou, Chryssis; Russell, A.; Shvartsman, A. A. (2007)
      The abstract problem of using P failure-prone processors to cooperatively update all locations of an N-element shared array is called Write-All. Solutions to Write-All can be used iteratively to construct efficient simulations ...
    • Conference Object  

      Fault-tolerant discrete-time linear time-invariant filters 

      Hadjicostis, Christoforos N. (Institute of Electrical and Electronics Engineers Inc., 2000)
      Modular redundancy, the traditional approach to fault tolerance, is prohibitively expensive because of the overhead in replicating the hardware. In this paper we discuss alternative methods for obtaining fault-tolerant ...
    • Conference Object  

      Fault-tolerant linear finite state machines 

      Hadjicostis, Christoforos N.; Verghese, G. C. (Institute of Electrical and Electronics Engineers Inc., 1999)
      In this paper we develop a framework for constructing fault-tolerant dynamic systems, focusing primarily on linear finite state machines (LFSMs). Modular redundancy, the traditional approach to fault tolerance, is expensive ...
    • Article  

      Fault-tolerant semifast implementations of atomic read/write registers 

      Georgiou, Chryssis; Nicolaou, Nicolas C.; Shvartsman, A. A. (2009)
      This paper investigates time-efficient implementations of atomic read-write registers in message-passing systems where the number of readers can be unbounded. In particular we study the case of a single writer, multiple ...
    • Article  

      Finite-state machine embeddings for nonconcurrent error detection and identification 

      Hadjicostis, Christoforos N. (2005)
      In digital sequential systems that operate over several time steps, a state-transition fault at any time step during the operation of the system can corrupt its state and render its future functionality useless. Such ...
    • Article  

      A hybrid fault-tolerant algorithm for MPLS networks 

      Hadjiona, Maria; Georgiou, Chryssis; Papa, Maria; Vassiliou, Vasos (2008)
      In this paper we present a new fault tolerant, path maintaining, algorithm for use in MPLS based networks. The novelty of the algorithm lies upon the fact that it is the first to employ both path restoration mechanisms ...
    • Article  

      In-network data acquisition and replication in mobile sensor networks 

      Andreou, Panayiotis G.; Zeinalipour-Yazdi, Constantinos D.; Chrysanthis, Panos K.; Samaras, George S. (2011)
      This paper assumes a set of n mobile sensors that move in the Euclidean plane as a swarm. Our objectives are to explore a given geographic region by detecting and aggregating spatiooral events of interest and to store these ...
    • Article  

      Modeling the impact of permanent faults in caches 

      Sánchez, D.; Sazeides, Yiannakis; Cebrián, J. M.; Garćia, J. M.; Aragón, J. L. (2013)
      The traditional performance cost benefits we have enjoyed for decades from technology scaling are challenged by several critical constraints including reliability. Increases in static and dynamic variations are leading to ...
    • Article  

      Modelling control systems in an event-driven coordination language 

      Limniotes, Theophilos A.; Papadopoulos, George Angelos (2000)
      The paper presents the implementation of a railway control system, as a means of assessing the potential of coordination languages to be used for modelling software architectures for complex control systems using a ...
    • Article  

      MPI-FT: Portable fault tolerance scheme for MPI 

      Louca, Soulla P.; Neophytou, Neophytos; Lachanas, Adrianos; Evripidou, Paraskevas (2000)
      In this paper, we propose the design and development of a fault tolerant and recovery scheme for the Message Passing Interface (MPI). The proposed scheme consists of a detection mechanism for detecting process failures, ...
    • Conference Object  

      Nonconcurrent error correction in the presence of roundoff noise 

      Takos, G.; Hadjicostis, Christoforos N. (2006)
      This paper analyzes the effects of roundoff noise on our ability to nonconcurrently detect and identify transient faults that corrupt state variables during the operation of a fault-tolerant discrete-time (DT) linear ...